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HETEROLOGOUS PROTEIN PRODUCTION SY STEM 
USING AVIAN CELLS 



BACKGROUND OF THE INVENTION 
5 1 . Field of the Invention 

The present invention relates to novel expression systems that 
can produce biomedically important heterologous proteins including 
human erythropoietin (hereafter "EPO"), and more specifically to the 
production of various heterologous proteins by transfecting DNA 
10 encoding the proteins, such as the genomic DNA encoding EPO into 
avian cells. 

2. Related Arts 

l~Many recombinant proteins used in medicine are relatively small 
and simple in their structure, and biologically functional proteins can be 

15 produced in prokaryote such as £. coli. However, some human 
proteins of medical interest, such as TPA (tissue plasminogen 
activator), Factor VIII, EPO, etc. are more complicated because 
biological function requires post-translational modification, j For 
example, EPO is extensively glycosylated with the carbohydrate portion 

20 accounting for 40 % of the molecular mass. It has been shown that 
the carbohydrate portion of EPO is important for biological function. 
Accordingly, EPO produced in E. coli, yeast or insect is inactive or very 
weakly active in vivo, while EPO produced in COS or CHO cells was 
found to be fully active. Accordingly, those kinds of heterologous 

25 #coietQS have been produced only in mammalian, cells. 

In the meantime, the avian system has been used for the study 
of gene expression in higher eukaryote for a long time. One of the first 
viruses to be linked to tumors was the Rous sarcoma virus of chicken, 
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and this virus was instrumental in demonstrating that the retroviral 
oncogene can originate from a cellular gene, leading to the concept of 
the protooncogen. Studies of gene expression have also been done 
using the RSV LTR promoter, which has often be used for high level 
5 expression of heterologous genes in mammalian cells. In addition, 
avian embryo cells have been used extensively in studies of various 
animal viruses. 

SUMMARY OF THE INVENTION 

The present invention is a research for the high level expression 
10 of eukaryotic heterologous proteins. It is an object of the present 
invention to provide a novel heterologous gene expression system 
which can produce proteins of higher eukaryotic cells. It is another 
object to provide the method of efficiently producing higher eukaryotic 
proteins, such as EPO, etc., which has been known to be active only 
15 when they are produced in a mammalian cell. It is a further object of 
the invention to provide the method of producing, especially, EPO 
among the eukaryotic proteins described above. 

To accomplish the objects of the present invention, the present 
invention provides a heterologous gene expression system comprising 
20 a DNA encoding a heterologous protein, a vector for receiving the DNA; 
and an avian cell for harboring the vector. 

The present invention also provides a method of producing a 
heterologous protein comprising the steps of culturing the expression 
system of claim 1 in media to express the heterologous gene, and 
25 purifying the heterologous proteins from the cell and the media. 

Preferably, the heterologous protein of the present invention is 
selected from the group consisting of those proteins that are known to 
be active only when expressed in mammalian cells (such as EPO, TPA, 
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Factor VIII, etc.) and preferably, the vector contains a promoter 
selected from the group consisting of SV early promoter, major 
immediate early promoter of human cytomegalovirus (hereafter "HCMV 
MIEP") and RSV LTR, and preferably, the avian cell is selected from 
5 the group consisting of duck embryo cell (hereafter "DE"), chicken 
embryo fibroblast (hereafter "CEF") and quail fibrosarcoma (hereafter 
"QT"), more preferably QT-VC which was isolated by the inventors. 
QT-VC was deposited to the International Depository Authority, Korea 
Research Institute of Bioscience and Biotechnology Korean Collection 
10 for Type Culture, and assigned a deposit number of KCTC 0277BP on 
August 22, 1996. The deposited QT-VC was transfected with the 
expression vector containing SY-EPO cDNA as described in Fig. 8. 

More preferably, the DNA encoding the heterologous protein is 
genomic DNA or cDNA. 

15 Further, the present invention provides an EPO production 

system comprising a DNA encoding EPO, a vector for receiving the 
DNA, and an avian cell for harboring the vector. 

Moreover, the invention provides a method of producing EPO 
comprising the steps of inserting a DNA encoding EPO into a vector, 
20 transfecting the vector into an avian cell, and culturing the transfected 
avian cell in media. 

Preferably, the avian cell of the EPO production system is DE or 
QT, and the DNA is a genomic DNA encoding EPO, more preferably, 
the DNA selected from the group consisting of SY, JM, SH and HE 
25 described in Fig. 5. 

Preferably, the vector has a promoter selected from the group 
consisting of SV early promoter, HCMV MIEP and RSV LTR. 

The present invention also provides an avian cell as a host for 
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expressing genes encoding mammalian proteins. 

Further, the present invention provides an novel EPO genomic 
sequence selected from the group consisting of SY, JM, SH and HE 
described in Fig. 5, and also provides an novel EPO amino acid 
5 sequence selected from the group consisting of JM, SH and HE 
described in Fig. 6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

JFic^J^ shows the expression of the bacterial CAT gene in avian 
cells. DE and CEF cells were transfected with pRc/CMV containing 

10 (+) or lacking (-) the CAT sequence. CAT activity was measured by 
determining the amount of acetylated chloramphenicol (AC) produced 
from ^Chloramphenicol. The values shown are from one 
representative of more than five independent assays. For this 
particular experiment 10 \ig of protein was reacted with 14 C- 

15 chloramphenicol for 20 min at 37 °C. 

Fig. 2 shows the comparison of CAT gene expression between 
various cell types and between different promoters. The three 
promoter-CAT fusion constructs were transfected into DE, CEF, CHO- 
K1 , and HeLa cells, and CAT activity was measured as described in 
20 Fig. 1. S, SV40 early promoter; C, HCMV MIEP; R, RSV LTR. The 
values shown are from one representative of three independent 
assays. For this particular experiment, 10 |ig of protein was reacted 
with 14 C-chloramphenicol for 30 min at 37 °C . 

^Fig^3. shows the efficiency of DNA transfection in various cells. 
25 pCMV-lacZ constructs was transfected into DE, CHO, Vero, HeLa, 
and 293T cells by calcium phosphate-DNA coprecipitation using the 
conditions used for the experiments shown in Fig. 2. Two days after 
transfection, cells were fixed and stained with X-gal. The number of 
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blue cells per 60 mm tissue culture plate was counted. The total 
number of cells between plates were comparable at 1-3 X 10 5 . 
Transfection efficiency was calculated relative to DE cells. 

Fig. 4 s hows the schematic diagram for cloning of human EPO 
5 and construction of expression vectors. The five blocks represent the 
five coding regions of EPO. The first PGR was performed using 
primers 25 and 33. The amplified DNA fragment was cloned and 
subjected to a second PCR using primers 12 and 9. The wavy tale in 
primer 12 contains the nucleotide sequence from the first coding region. 
10 Therefore, the second PCR generates the entire coding sequence of 
EPO so that the first and the second coding regions are attached to 
form without intron between them. Primers 12 and 9 contain Hindlll 
linkers at their 5 1 ends, enabling cloning of the EPO genomic sequence 
into various expression vectors. 

15 Fig. 5 is various EPO genomic DNA sequences. SY, SH, HE 

and JM are the EPO genomic DNA sequences cloned by the present 
invention, and AM and Gl are the EPO genomic sequences which has 
been already reported. Since the intron between the first coding 
region and the second coding region was deleted during the cloning, 

20 the deleted intron is not shown in Fig. 5. 

Fig. 6 is various EPO amino acid sequences. SY, SH, HE and 
JM are the EPO amino acid sequences cloned by the present invention, 
and AM and Gl are the EPO amino acid sequences which have been 
already reported. The abbreviation of the amino acids are as follows: 

25 A: alanine R: arginine N: asparagine D: aspartic acid 

C: cystein Q: glutamine E: glutamic acid H: histidine 

I: isoleucine L: leucine K: lysine M: methionine 
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F: phenylalanine P: proline S: serine 

T: threonine W: tryptophan Y: tyrosine V: valine 

Fig^JLshows the comparison of CAT gene expression between 
QT-VC and other mammalian cell lines. pCMV-CAT was transfected 
5 to QT-VC; CHO-K1, and Vero cells, and CAT activity was measured as 
described in Fig. 1. The transfection efficiency, as measured by X-gal 
staining following cotransfection with pCMV-IacZ, was reproducibly 3- 
5 % in all cases. For this particular experiment, 50 |ig of protein were 
incubated with 14 C-chloramphenicol for one hour at 37 °C. 

10 Fi g. 8 i s the typical structure of a plasmid used to express EPO 

in QT cells. The two types of BamHI cassettes which could express 
the gene for human glutamine synthetase (GS) was made. In these 
BamHI cassettes, the GS cDNA sequence was flanked by the poly A 
sequence from the bovine growth hormone gene and one of the two 

15 promoters, the partial MMTV LTR (from -220 to +1 5 from the RNA start 
site) or the 220 bp HSV tk promoter. The BamHI fragment expressing 
GS was inserted into the BamHI site of pCI-neo (Promega, Madison, 
Wl, USA), resulting in a series of pIGA. The Hindlll fragment of the 
SY-EPO cDNA sequence was cloned into the Smal site of pIGA, 

20 generating the EPO expression vector, pIGA-EPO. 

Fig^shows the production of EPO by QT-N4D4. QT-N4D4 
cells were grown to confluence in a 10 cm culture dish (day 0) in M-199 
containing 10 % FBS and 1 mM MSX. On day 3, the EPO level was 
measured. The cells were then split into 1:3 and seeded onto 10 cm 
25 dishes. On day 6, the cells were again reached confluence, and the 
medium was replaced with 10 ml fresh medium containing 2 % (•) or 
1 0 % (O) FBS. EPO levels were determined by ELISA (R&D system, 
Minnesota, USA) 
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^Fig^J^ shows the comparison of EPO concentration in DE (#) 
and QT-N4D4 (O) measured by ELISA and by in vitro bioassay. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The inventors have explored the possibility of using avian cells 

5 as a host cell for heterologous gene expression. We have chosen to 
use three avian cells; two embryonic cells from chicken and duck, and a 
quail fibrosarcoma line. We chose to use the chicken and duck 
embryo cells for the following the reasons. First, these embryonic 
cells can easily be prepared from eggs, and they divide rapidly, 

10 undergoing many passages. Second, chicken and duck cells can be 
grown at large scale with relatively low costs. Third, some avian cells, 
such as those from chicken embryos have already been used for 
medical products. For example, influenza virus has been cultured in 
chicken eggs for the production of vaccines. Finally, the culture 

15 conditions, including media and temperature, required by avian embryo 
cells are virtually identical to those of mammalian cells, suggesting that 
the physiology of avian and mammalian cells is probably comparable. 

Further, the reason of choosing a QT cell line is that various 
transformed cell lines have been already developed, and it is easy to 

20 handle these cell lines to construct a permanent cell line expressing a 
heterologous protein, and the culture conditions and media is similar to 
those of mammalian cells. 

I. Cells and Plasmids 

1. Cells 

25 The following Table 1 shows cells used in the experiment. 
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Table 1 



Cells 


Source 


HeLa human cervical carcinoma cells 


ATCC CCL2 


Vero African green monkey kidney cells 


ATCC CCL81 


COS-7 African green monkey kidney cells 
transformed by wild-type T antigen of 
SV40 


ATCC CRL1651 


CHO-K1 Chinese hamster ovary cells 


ATCC CCL61 


NIH3T3 contacted-inhibited Swiss mouse 
embryo cells 


ATCC CRL1651 


Ad-5 transformed human embryonic 
kidney cells 293 


ATCC CRL1651 


SL-29 chicken embryo fibroblast cells 


ATCC CRL1590 


Duck embryo 


ATCC CCL141 
or prepared by the 
inventors 


Quail fibrosarcoma line QT6 


ATCC CRL1708 


Quail fibrosarcoma line QT-VC 


Isolated by the inventors 
KCTC 0277BP 



All these cells except QT cell lines were grown in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine 
serum (FBS). QT cell lines were cultured in M199 medium instead of 
DEME. Duck embryo was either obtained from ATCC CCL 141 or 
prepared by trypsinization of 10- to 13- day old decapitated duck 
embryos. These avian cells were grown in minimum essential medium 
(Eagle) supplemented with non-essential amino acids and Earle's 
balanced salt solution containing 10 % FBS. These cells could be 
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maintained for approximately another 30 passages. Each medium 
used in this study was supplemented with 120 jig/ml penicillin G 
(Sigma P-3032; 1690 units per mg) and 200 |xg/ml streptomycin (Sigma 
S-9137; 750 units per mg). 

5 2. Plasmids 

To evaluate the efficiency of heterologous protein production in 
avian cells, pRc/RSV-CAT and pRc/CMV-CAT were constructed by 
inserting a Hindlll CAT cassette (Pharmacia, Piscataway, NJ) into the 
Hindlll sites of pRc/RSV and pRc/CMV (Invitrogen, San Diego, 

io California, USA), respectively. For pSVCAT, the plasmid p9l8 was 
used, which has been already described by the inventors. For EPO 
expression vectors, three vectors were used. pCMV-gEPO was 
constructed by cloning the Hindlll fragments of the EPO genomic 
sequence into the Hindlll site of pRc/CMV. pSV-gEPO was derived by 

15 replacing the CAT sequence of pSV918 with the genomic EPO 
sequence. pIGA-EPO has cDNA of EPO controlled by HCMV MIEP 
and the genes of NEO and glutamine synthetase (hereafter "GS"). To 
measure the transfection efficiency, the plasmid pCMV-lacZ was 
constructed by inserting bacterial lacZ fragment into the Hindlll site of 

20 pRc/CMV. 

II. DNA Transfection and Gene Expression Assays 

The inventors tested whether avian embryo cells could be used 
for high levels of heterologous gene expression instead of mammalian 
cells. Although avian embryo cells have been used to culture viruses, 
25 there was no report that heterologous proteins of higher eukaryotic cells 
were expressed in these cells. To carry out the study, it is necessary 
to develop the method of efficient transfection to avian cells. That is, 
to express heterologous genes in avian cells, it is required to develop 
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the transfection technique of DNA to target cells. At present, we could 
not find any reports on DNA transfection of avian embryo cells. 
Accordingly, the inventors have developed the technique that CEF and 
DE cells can readily be transfected with DNA. 

5 Among the techniques available, we have chosen a method 

using calcium phosphate coprecipitation, because this works well for 
various adherent cells and can also be used for establishing permanent 
lines. We have tested many different conditions and found that the 
following procedure was optimum. 

10 When cultures were 50-70% confluent in a 100 mm culture dish, 

a total of 10 jig DNA in HBS buffer (140 mM NaCI, 5 mM KCI, 0.75 mM 
Na 2 HP0 4 .2H 2 0, 6 mM dextrose, 25 mM HEPES) was incubated with the 
cells for 30 min at room temperature. 10 ml of regular media 
containing FBS was added and incubated for 20 hrs at 37 °C, except for 

15 CHO-K1 (8 hours). Cells were then treated with 10 ml of 100 \iM 
chloroquine, and incubated for another 3 hours at 37 °C. After 
replacement with 10 ml of fresh media, the cells were grown for 1 to 2 
days. Culture supernatants were collected and centrifuged at 1000 
rpm for 10 min to remove cells and debris. To measure transfection 

20 efficiency, cells were transfected with pCMV-lacZ, rinsed once with 
PBS 3 days after transfection, fixed with 0.5 % glutaraldehyde (in PBS) 
for 10 min, and washed twice for 2-10 min each with 4 ml PBS 
containing 1 mM MgCI 2 . For X-gal staining, the staining solution [PBS 
containing 4 mM K 3 Fe(CN) 6( 4 mM K4Fe(CN) 6 .3H 2 0, 2 mM MgCI 2) and 

25 400 ii g per ml X-gal (in dimethylformamide)] was added to fixed cells, 
and incubated at 37 °C for 4 hours overnight. When the reaction was 
completed, cells were washed once with PBS. Stained cells were kept 
in PBS. 
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CAT assay was carried out as follows: 

Two to three days after transfection, cells were harvested, 
washed once with PBS, and resuspended in 0.25 M Tris-HCI (pH 7.5). 
Total proteins were prepared by 4 cycles of freeze/thawing followed by 
heating at 65 °C for 7 min. Equivalent amounts of protein were 
assayed for CAT activity at 37 °C for 30 min. The amount of protein 
and the reaction time varied, depending on the experiments. For 
example, the CAT activity of cell extracts prepared from DE cells was 
so high that only 10 (ig protein and 20 to 30 min reaction time had to be 
used, and under this condition, levels of CAT activity in other 
mammalian cells were very low or undetectable. When CAT activity 
became detectable in other cells, virtually all 14 C-chloramphenicol was 
converted. The percent conversion of 14 C-chloramphenicol to its 
acetylated forms was determined by cutting out regions containing 
unreacted and acetylated forms and quantifying the amount of 
radioactivity in each by liquid scintillation counting. 

III. Gene Expression in DE and CEF 

Gene expression efficiency of DE and CEF was measured using 
CAT gene. We initially chose to use a promoter from the major 
immediate-early region of HCMV, because this has been shown to 
drive a high level gene expression in many different cell types. In the 
plasmid pCMV-CAT, the bacterial CAT gene is placed under the control 
of the HCMV MIEP. As a negative control, the plasmid Rc/CMV 
containing the promoter but no CAT sequence was used. These 
plasmids were transfected into DE and CEF cells and CAT activity was 
measured to estimate the efficiency of transfection and gene 
expression. One representative result from several independent 
transfections is shown in Fig. 1. Transfection of a control plasmid 
resulted in undetectable levels of CAT activity in both cells. However, 
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transfection with pCMV-CAT resulted in readily detectable levels of 
CAT activity in both cells. In more than five independent transfection 
assays, the level of CAT activity was always higher in DE cells than in 
CEF cells. The magnitude of difference in the level of CAT activity 
between the two cells ranged from 10- to 50-fold, depending on the 
experiment. This result indicated that avian cells were readily 
transfected with DNA and the heterologous genes could be efficiently 
expressed. 

IV. Comparison of Levels of Gene Expression between Avian 
and Mammalian Cells, and between Different Promoters 

We have compared the levels of gene expression between avian 
and mammalian cells, using three different promoters; 

(1) the SV40 early promoter, which is used during the early 
transcriptional phase of SV40 infection; 

(2) the HCMV MIEP, which drives the expression of IEI and IE2 
regulatory proteins, immediately after HCMV infection; 

(3) the RSV LTR from an avian retrovirus. 

These promoters are known to be powerful in mammalian cells, 
and have often been used for high level heterologous gene expression. 

These promoter-CAT fusion constructs were transfected into 
four different cell lines, DE, CEF, CHO-K1, and HeLa, and CAT activity 
measured to compare the efficiency of gene expression between 
promoters and between cell types. To make this comparison 
semi-quantitative, all transfections and CAT assays were performed at 
the same time and using identical conditions. One representative 
result of such experiments is shown in Fig. 2. Here, 10 \xg of cell 
extracts were incubated for 30 min in the CAT reaction. Under these 



particular conditions, the levels of CAT expression driven from the three 
promoters were very low in CHO and HeLa cells (Fig. 2). Only after 
larger amounts of proteins were used for extended reaction time, was 
CAT activity detected. In contrast, CAT activity was readily detectable 
5 in the avian cells (Fig. 2), except for the SV40 promoter in CEF cells. 
It indicated that the expression in avian cells are more effective than 
that in mammalian cells. 

The most dramatic finding was that the HCMV MIEP was 
extremely powerful in DE cells. In Fig. 2, the conditions used for the 

10 CAT reaction were chosen to generate the reasonable levels of CAT 
activity in other samples. When the CAT reaction was performed 
under limiting conditions for the protein sample prepared from DE cells 
transfected with pCMV-CAT (i.e., when the CAT conversion was below 
50 %), the levels of CAT activity of al! the other samples were virtually 

15 undetectable. Therefore, the magnitude of difference in CAT activity 
between the protein sample from DE cells transfected with HCMV-CAT 
and those from the other transfections is at least two orders of 
magnitude. These results suggested that heterologous genes could 
be expressed very efficiently under the control of the HCMV MIEP in 

20 DE cells. 

It is possible that the high levels of CAT expression seen in DE 
cells could be due to efficient transfection of the cell population, rather 
than an ability of these cells to support strong gene expression. To 
distinguish these possibilities, we transfected pCMV-lacZ into DE and 
25 various animal cells. After transfection, cells were stained with X-gal, 
and the number of blue cells were counted to estimate the transfection 
efficiency. As shown in Fig. 3, the number of stained cell was always 
comparable between DE and other animal cells, suggesting that the 
high levels of CAT expression in DE cells were due to high levels of 
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expression in individual cells. 

V. Cloning of human erythropoietin 

To test whether DE cells could indeed be used for the 
expression of medically important human proteins, we have isolated the 
5 genomic DNA encoding the human EPO gene. We chose to use EPO 
as a model because it is a secreted protein, so we could test whether 
DE cells properly process secreted proteins. We also used a genomic 
clone of EPO instead of the cDNA, to assess whether human genes are 
properly spliced to produce functional mRNAs in DE cells. 

10 DNAs for cloning of EPO were prepared with blood cells 

collected from four people. Human peripheral blood lymphocytes 
were isolated by Ficoll-Hypaque gradient centrifugation of 
heparin-treated blood cells. Total DNA was prepared and used for 
polymerase chain reaction using specific oligonucleotide primers (Fig. 

15 4). The region around the start codon was highly GC rich, so the EPO 
sequence was cloned by two steps of PCR using two different pairs of 
primers. 

To obtain the genomic DNA for EPO, total DNA was prepared by 
lysing human peripheral blood lymphocytes using TES (10 mM Tris-HCI 
20 pH 7.8; 1 mM EDTA; 0.7 % SDS) followed by the treatment with 
400 \ig/m\ proteinase K at 50 °C for 1 hour, phenolxhloroform 
extraction, and ethanol precipitation. The polymerase chain reaction 
(PCR) was performed using 0.1 \ig of total genomic DNA and 
oligonucleotide primers specific to the EPO gene. 

25 Primer #25 (sense, 5 1 to 3'): GAAGCTGATAAGCTGATAACC 

Primer #33 (antisense, 5' to 3'): TGTGACATCCTTAGATCTCA 

The samples were amplified through 30 cycles that included the 
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following parameters; denaturation at 92 °C for 1 min, primer annealing 
at 55 °C for 1min, and primer extension at 72 °C for 1 min. The DNA 
fragment amplified from this reaction did not contain the first 13 
nucleotides in the N-terminal region, so a second PCR was performed 
using the following primers (Underlined, Hindlll; Outlined, start codon 
and stop codons, respectively). The relative position of these primers 
are as shown in Fig. 4. Taq DNA polymerase (POSCO Chem, Korea) 
and pfr polymerase (STRATGENE, California, USA) were used to 
amplify DNA. 

Primer #12 (sense, 5' to 3'): 

CAAGCTTCGGAGATCGGGTGCACGAATGTCCTGCCTGGCTGTGGC 

Primer #9 (antisense, 5' to 3'): 
CAA^CTTTCATCTGTCCCCTGTCCTGC 

The amplified DNA from the second PCR was cloned into the 
pCRII (Invitrogen), from which the Hindlll fragment containing the 
genomic sequence of EPO was inserted into various expression 
vectors as described above. In this experiment, the amplified DNA 
was placed under the control of the HCMV MIEP or SV40 early 
promoter, generating pCMV-gEPO and pSV-gEPO respectively. SY- 
EPO whose amino acid sequence is identical to that of the already 
known EPO is used for the expression experiments in the sections VII 
and VIII (See the section VI). 

VI. Analysis of Nucleotide Sequences of Cloned EPO Genomes 

Genomic structure of EPO cloned by the above method is different from 
the natural EPO genome in vivo. That is, wild type EPO genomic DNA 
has five coding regions and four introns between them. However, in 
the DNA cloned by the above method, the first coding region was fused 
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to the second coding region to form one coding region so that it has 
four coding regions and three introns (Fig. 4). 

The results from the analysis of EPO gene sequences isolated from 
four people suggested that nucleotide sequences of EPO cloned from 

5 these region are significantly different from those of the prior two EPOs 
(AM-EPO and GI-EPO) (Fig. 5) at the sites of intron. Such a 
difference was not due to the error which occurred during DNA 
amplification in the process of cloning. We repeated cloning and 
sequencing using DNAs prepared from same individuals (but at 

10 different times) and obtained the same nucleotide sequence. As 

another control, we amplified the already cloned EPO under the similar 
conditions, and determined the nucleotide sequence. Again, we 
obtained the same nucleotide sequence. 

Amino acid sequences of four EPO genes, together with AM and Gl, 
15 are shown in Fig. 6. Amino acid sequences from AM, Gl and SY are 
identical. However, amino acid sequences from three people (JM, SH, 
HE) different by two or three different amino acids from Gl- and AM- 
EPO, suggesting that there is a polymorphisms among people. When 
compared with AM- or GI-EPO, HE-EPO had three different amino 
20 acids at C-terminal, SH-EPO three different amino acid over the whole 
polypeptide, and JM-EPO two different amino acids, one at C-terminal 
and the other in the middle of polypeptide (See Fig. 6). For example, 
while AM-EPO and GI-EPO had serine, alanine, and valine at positions 
36, 100 and 170 respectively, SH-EPO had arginine, serine, and 
25 tyrosine. Further, while AM-EPO and GI-EPO had valine, lysine, and 
aliginine at positions 170, 177, and 191, HE-EPO had tyrosine, 
glutamine, and glycine. In JM-EPO, lysine and tyrosine were present 
at positions 54 and 170, while they were threonine and valine. These 
results suggested that the EPO gene has a polymorphism in amino 
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acids sequence as well as DNA sequence. 

VII. Expression of EPO in DE Cells 

In this experiment, we compared levels of EPO expression 
between DE cells and other cell lines. 

5 EPO expression vectors were transfected into various cells 

including DE, CEF, CHO, HeLa, VERO, and 293T. We have included 
VERO cells because they are often used for heterologous gene 
expression, and 293T cells which drive very high levels of gene 
expression, presumably due to both the high frequency of DNA 

10 transfection and the presence of potent viral transactivators such as 
EIA, EIB, and large T antigen. Two to three days after transfection, 
levels of EPO in the culture supernatants were measured by the 
enzyme linked immunoadsorbent assay, and transfection efficiencies 
were determined by staining cells adhered on the culture with X-gal. 

15 Transfection efficiency was carried out by transfection of a lacZ 
expression vector together with an EPO expression vector as described 
in the section II. One representative result of this analysis is 
summarized in Table 2. 



Table 2 



Cell 


HCMV MIEP 


SV40 early 
promoter 


HCMV/SV40 


293 


314 


17.5 


18 


CHO 


139.4 


10.4 


13.5 


VERO 


250 


10.7 


23.5 


NIH3T3 


89 


79.4 


1.1 


DE 


4335 


13.8 


314.8 



20 

When the SV40 early promoter was used, there was little 
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difference in the levels of EPO between cell types. However, when 
the HCMV MIEP was used, DE cells produced much higher levels of 
EPO than any other cell lines tested. The HCMV MIEP was much 
more active than the SV40 early promoter in almost all the cells tested. 
5 This difference was especially pronounced in DE cells, where the 
former produced 315 times more EPO than the latter. Among the 
various cell types, DE cells always produced the highest level of EPO. 
CHO cells are the source of cell lines producing EPO that is currently 
used for human application. In this transient system, however, the 

10 level of EPO in CHO cells was at least 30-fold lower than in DE cells. 
Difference between DE and 293T cells was also considerable. 
Transfection efficiency of 293T was higher by about 30-folder than any 
other cells including DE cells. Moreover, 293T cells produce potent 
viral transcription transactivators. Nevertheless, DE produced 10- 

15 folder more EPO than 293T, suggesting that DE could drive high levels 
of the gene expression. 

In conclusion, human EPO could efficiently be produced and 
secreted in DE cells and that the HCMV MIEP is the promoter of choice 
for driving high level heterologous gene expression in DE cells. 

20 In summary, we found that DE cells could produce very high 

levels of bacterial and human proteins. All three promoters tested 
drove higher levels of gene expression in DE cells than any other cell 
lines used in this study. In particular, the HCMV MIEP was extremely 
powerful in DE cells. The high level of heterologous gene expression 

25 observed was not due to a higher number of transfected cells. It 
appears that DE cells properly process splicing and secretion because 
transfection of DE cells with an expression vector containing the EPO 
genomic DNA sequence produced a large quantity of EPO in the 
culture supernatant. 
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For DE cells to be used for industrial purpose, one would need to 
develop large-scale culture techniques for these cells. There are two 
possible ways. First, it may be possible to use primary cells 
themselves as the producer line. A large number of DE cells can 
5 easily be prepared from 10- to 13 day-old duck embryos. From one 
embryo, we cap readily obtain 10 9 to 10 10 cells that can undergo at least 
15 passages. Therefore, it is possible to transfect DE cells at the 
earliest possible stage with an expression vector followed by selection 
of transfected cells, which might require 4-7 passages. Even if less 
10 than 5% of the cells were transfected, a large number of transfected 
cells would be available, suggesting that large-scale culture of primary 
duck embryo cells is not impossible with primary cells. Second, it will 
be possible to transform duck embryo cells at an early stage, using one 
of the large number of well-characterized oncogenes that are available. 
15 With transformed DE cells, a producer line could be constructed, and 
better quality control of protein production be established. It remains 
to be seen whether transformed DE cells will still maintain the capability 
for high level gene expression. Although a number of biological 
questions remain to be answered, the potential of these cells for the 
20 production of various proteins warrants further investigation. 

VIII. Heterologous Gene Expression 
in the Transformed Avian Cell Line 

The above experiments demonstrated the great potential of DE 
cells as producers of heterologous proteins such as EPO. However, 
25 DE cells used in the above experiments are primary cells and stop 
dividing after 30-40 passages in vitro. Therefore, unless DE cells are 
transformed or special techniques are developed as described above, it 
is difficult to use these embryonic cells for industrial production of 
heterologous proteins. 
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In the following study, we tested whether the transformed avian 
cell line, namely the quail fibrosarcoma line, could be used to produce 
EPO. The quail fibrosarcoma line used in this study, QT-VC, was 
subcloned from QT6 (ATCC CRL1708). This line was derived from 

5 methylcholanthrene-induced fibrosarcoma of Japanese quail. QT-VC 
is different from its parental line in at least two aspects. First, QT-VC 
grows faster than the parental line in M199 medium containing 10% 
FBS used in this study. The former divided every 12-24 hours, while 
the doubling time of the latter was 24-36 hours. Second, the QT-VC 

10 cell looks more roundish than QT6 which generally grows in a longish 
form. Like its parental line, QT-VC did not grow well when it was 
seeded at a low density. Therefore, cells had to be split to 1/3 to 1/2 
after reaching confluence for continuous culture. 

1 . Analysis of Gene Expression in QT-VC Cells 

15 We compared the levels of gene expression between QT-VC 

and mammalian cells using pCMV-CAT. We chose to use the HCMV 
MIEP as this promoter was shown to drive high levels of gene 
expression in various cell types including avian cells (See the section 
IV). pCMV-CAT was transfected into 3 cell lines, QT-VC, CHO-K1 and 

20 Vero. To make this comparison semi-quantitative, all transfections 
and CAT assays were performed at the same time and using identical 
conditions. Transfection efficiency was also measured by 
cotransfecting pCM-lacZ followed by X-gal staining. The efficiency 
was approximately 3 % in all cases. Under these conditions, the 

25 levels of CAT expression in QT-VC cells were always 2-3 times higher 
than mammalian cell lines used in this study (Fig. 7). Although the 
level of gene expression in QT-VC cells appears to be lower than DE 
cells, the quail fibrosarcoma line is at least as good as mammalian cell 
lines, suggesting that it could be used as a producer for heterologous 
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proteins. 

2. Construction of EPO Expression Vectors for QT-VC Cells 

To test whether high levels of heterologous proteins could be 
expressed in QT cells, we have constructed other EPO expression 
5 vectors. The basic strategy for the construction of an expression 
vector was as follows: 

First, we chose to use the HCMV MIEP to drive expression of the 
heterologous gene as it had already been shown to be one of the 
strongest promoters in avian cells as well as mammalian cells. 

10 Second, the human glutamine synthetase (GS) gene was used 

for amplification of the target gene. Generally, the gene of interest is 
amplified to augment the yield of protein by using certain selectable 
markers in the presence of specific chemicals. One of the best 
examples is the dihydrofolate reductase (DHFR) gene. It has been 

15 shown that the copy number of the heterologous gene and the level of 
respective protein increase as the concentration of methotrexate (MTX) 
in the medium is slowly increased. However, this system requires the 
host cell defective in the gene DHFR, so cannot be directly applied to 
QT cells for which such a mutant line is not yet available. For this 

20 reason, we chose to use the GS gene. In this case, the host cell line 
need not to be deficient for GS, because only multiple copies of the GS 
gene can confer resistance to methionine sulfoximine (MSX). 

The overall structure of EPO expression vectors constructed for 
the use in QT cells is shown in Fig. 8. In this structure, the cDNA 
25 sequence for EPO is under the control of the HCMV MIEP, the bacterial 
Neo gene is used as the first selectable marker, and the human GS 
gene is also present as the second selectable marker in the same 
plasmid. The backbone of expression vectors used in this particular 
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experiment was pCI-neo (Promega, USA) which uses the HCMV MIEP 
and the intron from the p-globin genome. We have made a couple of 
different constructs in which the GS gene is driven by the partial MMTV 
LTR (from -220 to +15) or the 220 bp HSV tk promoters. In either 
case, the magnitude of gene amplification appears to be comparable 
(Data not shown). Detailed procedure and supplementary data 
regarding the construction of expression vectors is available upon 
request. 

3. Construction of QT-VC Cells Stably Expressing EPO 

To construct QT-VC cell lines constitutively expressing EPO, the 
cells were transfected with an EPO expression vector by a calcium 
phosphate coprecipitation method as described in the section II. 
Three days after transfection, EPO production was confirmed by EILSA 
and transfected cells were treated with G418 (0.8 mg/ml) and MSX (25 
jaM). When G41 8-resistant cells were grown to confluence, cells were 
diluted for sublconing. Because QT-VC cells do not grow efficiently at 
a low cell density, cells were seeded on 10-cm culture dishes at various 
numbers (10 2 , 10 3 , 10 4 , 10 5 per dish). Then the colonies that grew 
distant from other colonies were isolated by plastic O rings and 
expanded onto a 96-well plate. When cells reached 70 % confluence, 
the EPO level was measured. Subclones that produced more than 
200 U/ml were serially expanded from the 12-well to 6-well to 60 mm 
culture plates. When cells reached confluence on a 60 mm dish, cells 
were split on 6-well plates and then treated with various concentrations 
of MSX (100 |iM, 250 juM, 1 mM). Using this procedure, several 
subclones that produced large amounts of EPO and also grew fast 
were selected. 

One of the subclones obtained through this procedure is QT- 
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N4D4. As shown in Fig. 9, this subclone produced 1200 U/ml when 
grown for 3 days after confluence. When the cells were split to 1:3, 
seeded on 10 cm dishes, and allowed to grow for another 3 days, N4D4 
still produced 1000 U/ml. The medium was then replaced with a fresh 
5 media containing 2 % FBS and the cells still produced 400 U/ml EPO. 
These results indicated that QT cells could produce a large quantity of 
EPO. 

In conclusion, the above experiment demonstrated the great 
potential of QT cells as a producer for heterologous protein. 

1 ° IX. Biological Activity of EPO Produced in Avian Cells 

EPO is heavily glycosylated and such glycosylation is required 
for its biological activity. For example, EPO produced in E. coii or 
yeast is inactive or very weakly active in vivo. To test whether EPO 
expressed in DE or QT cells was biologically active, we carried out an in 
15 vitro bioassay using spleen cells isolated from mice treated with 
phenylhydrazine. 

EPO assay: Absolute levels of EPO production after 
transfection of various cells were determined by enzyme linked 
immunoadsorbent assay which is currently used to measure EPO 

20 levels in the human serum (R&D Systems Inc., Minnesota, USA), To 
measure the biological activity of EPO, in vitro bioassay was carried out 
by the method of Krystal as modified by Goldberg et al Spleen cells 
were taken from C57BL X C3H Fl hybrid mice (Seoul National 
University Laboratory Animal Center) on day 3 after the second of two 

25 daily injections of phenylhydrazine (60 mg/Kg of body weight per day) 
and spleen cell suspensions were prepared with Lymphoprep™ 
(NYCOMED PHARMA AS, Oslo, Norway). The spleen cells (final 
concentration 4 X 10 6 cells per ml) were then incubated in 24 well tissue 
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culture plates with various standard doses of EPO (CILAG AG 
International, Switzerland; specific activity 2000 U/ml) or unknown 
samples for 22 hr and then pulsed with 4 uCi/well tritiated thymidine 
(Amersham Co.) for 2-3 hr. The cells were harvested, washed with 
PBS several times and lysed by 0.3 N NaOH and 0.1% SDS. 
Radioactivity in LSC cocktail solutions were calculated by a Pharmacia 
Wallac 1410 scintillation counter. 

Culture supernatants from QT-N4D4 cells or DE cells transfected 
with EPO expression vectors were taken to measure levels of EPO by 
both ELISA and the bioassay. The ELISA measures absolute 
concentration, and is currently used for determining EPO concentration 
in human serum. On the other hand, the bioassay determines 
biological activity using a control EPO that has been produced from 
mammalian cells and is currently being used in humans. Fig. 10 
compares the difference in levels of EPO determined by these two 
methods. The ratio between the values (mU) was I ± 0.15, and the 
specific activity of EPO produced from DE cells was estimated to 105 
U/ug. Therefore, the levels of EPO measured by ELISA were very 
comparable to those obtained by the bioassay. This result suggested 
that EPO produced from these avian cells had a similar biological 
activity to commercially available EPO. 
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What is claimed is: 

-1. An EPO production system comprising: 

a DNA encoding EPO; 

a vector for receiving the DNA; and 

an avian cell for harboring the vector. 

2. The EPO production system of claim 1 , wherein the avian cell is DE or 
CEF or QT. 

3. The EPO production system of claim 2, wherein the QT is QT-VC. 

4. The EPO production system of claim 1 , wherein the DNA is a genomic 
DNA encoding EPO. 

5. The EPO production system of claim 1, wherein the DNA encoding 
EPO is selected from the group consisting of SY. JM, SH and HE described in Fig. 
5. 

6. The production system of claim 1, wherein the vector contains a 
promoter selected from the group consisting of SV40 early promoter, HCMV 
MIEP and RSV LTR. 

7. A method of producing EPO comprising the steps of: 
inserting a DNA encoding an EPO into a vector; 
transfecting the vector into an avian cell; and 
culturing the transfected avian cell in media. 

8. The method of claim 7, wherein the avian cell is DE or CEF or QT. 

9. The method of claim 8, wherein the QT is QT-VC. 

1 0. The method of claim 7, wherein the DNA encoding EPO is a genomic 

DNA. 

11. The method of claim 7, wherein the DNA encoding the EPO is 
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selected from the group consisting of SY, JM, SH and HE described in Fig. 5. 

12. The method of claim 7, wherein the vector contains a promoter 
selected from the group consisting of SV40 early promoter, RSV LTR and HCMV 
MIEP. 

13. An EPO genomic sequence selected from the group consisting of SY, 
JM t SH and HE described in Fig. 5. 

14. An EPO amino acid sequence selected from the group consisting of 
JM, SH and HE described in Fig. 6. 

15. An avian cell as a host for expressing a gene encoding an EPO. 

16. The avian cell of claim 15, wherein the avian cell is DE or CEF or QT. 

17. The avian cell of claim 16, wherein the QT is QT-VC. 

18. A human heterologous protein production system comprising: 
a DNA encoding a human heterologous protein; 

a vector for receiving the DNA; and 
an avian cell for harboring the vector. 

19. The human heterologous protein production system of claim 18, 
wherein the human heterologous protein is selected from the group consisting of 
TPA, Factor VI I land EPO. 

20. A method of producing a human heterologous protein comprising the 
steps of: 

inserting a DNA encoding a human heterologous protein into a vector; 
transfecting the vector into an avian cell; and 
culturing the transfected avian cell in media. 

21 . The method of claim 20, wherein the human heterologous protein is 
selected from the group consisting of TPA; Factor VIII and EPO. 
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ABSTRACT 



Heterologous protein expression system including a 
heterologous gene DNA such as EPO genomic DNA, a vector receiving 
the DNA and an avian cell, such as duck embryo or quail fibrosarcoma 
cell line, expressing the gene in the vector can be used to efficiently 
produce heterologous proteins such as EPO. 
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FIG.5A 



AM 


ATGG6GGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 


GI 


ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 


SY 


ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 


JM 


ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 


SH 


ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 


HE 


ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 


50 



AM 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 


GI 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 


SY 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 


JM 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 


SH 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 


HE 


GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 


100 








AM 


GTGACAGCtGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 


150 



GI GTGACAGCCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

SY GTGACAGCCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

JM GTGACAGCCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

SH GTGACAGIaCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

HE GTGACAGCCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 



AM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

GI AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SY AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

JM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SH AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

HE AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 
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AM AGGGCTTCAGGi 

GI AGGGCTTCAGGGj- 

SY AGGGCTTCAGGG;- 

JM AGGGCTTCAGGG- 

SH AGGGCTTCAGGG- 

HE AGGGCT7CAGGGGA 



AAC i CCTC CCAGpiATC CAGG AAC CTGGC ACTTGG7TT 
AAC I CCTCCCAGj-iATCCAGGAACCTGGCACTTGGTTT 
AAC i CCTCCCAGj-jATCCAGGAACCTGGCACTTGGTTT 
AACTCCTCCCAd-SATCCAGGAACCTGGCACTTGGTTT 
AACTCCTCCCAd-'ATCCAGGAACCTGGCACTTGGTTT 
i ACTCCTCCCAGjSATCCAGGAACCTGGCACTTGGTTT 



248 
248 
248 
248 
248 
250 



AM GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAA.GAATAAGTCT 

G I GGGG l GGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 

SY GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 

JM GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 

SH GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 

HE GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 



A a A A r, A a "X" "A" 'K 'A'JCX* 



298 
298 
298 
298 
298 
300 



AM GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGC.AAAGCCAGCAGA 348 

GI GGTGGCCCCAAA.CCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 3^8 

SY GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

JM GGTGGCCCCAAACCA I ACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

SH GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

HE GG I GGCCCCAAACCA i ACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 350 



AM 
GI 
SY 
JM 
SH 
HE 



TCCTACSGCCTGTGGSCCAGGG^CA^AIGCCTTCAGGGACCCTTGACTCC 
TCCTAG-GCCTGTGdiCCAGGGtiCAfe-AjdcCTTCAGGGACCCTTGACTCC 
TCCTAdGGCCTGTGGdCCAGGGCC/^^CCrrCAGGGACCCTTGACTCC 



rCCTAGqGCCTGTGGGCCAGGGpCAGGAiGCCTTCAGGGACCCTTGACTCC 
5CTACGGCCTGTGdGCCAGGGi3CA|-(iAjGCCTTCAGGGACCCrrGACTCC 
i CC I ACGGCCTGTGGGCCAGGGCCAj-GAjqcCTTCAGGGACCCrrGACTCC 



397 
395 
397 
398 
397 
399 
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AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



CC'GGGCTGTHTGCATTiftAGAEGGGCTGTGCTGAACACTGCAGCnGMT 
CCGGGCTGWGCATTjrCAGACGGGCTGTGCTGAACACTGCAGCTTGAAT 
CCGGGCTGTfljrGCATTTrjCAGACGGGCTGTGCTGAACACTGCAGCTTGAAT 
CCGGGCTGTGh"GCATliTpAGAAGGGCTGTGCTGAACACTGCAGCTTGAAT 
CCGGGCTGllGTGCATTTCAGAfcGGGCTGTGCTGAACACTGCAGCTTGAAT 
CCGGGCTGTpiTGCATTTCpAGACGGGCTGTGCTGAACACTGCAGCTTGAAT 

7cicicic***ie*[^icyc-k~k-k ******** 

GAG(AATATCACTGTCCCAGACACCAAAGTTAAT7TCTATGCCTGGAAGAG 

gayaatatcactgtcccagacaccaaagttaatttctatgcctggaagag 
gaWaatatcactgtcccagacaccaaagttaatttctatgcctggaagag 
ga|j|aatatcactgtcccagacaccaaagttaatttctatgcctggaagag 
gagKatatcactgtcccagacaccaaagttaatttctatgcctggaagag 



gagmtatcactgtcccagacaccaaagttaatttctatgcctggaagag 

I. 



447 
445 
447 
448 
447 
449 
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495 
497 
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497 
499 



AM 
GI 
SY 
JM 
SH 
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AM 
GI 
SY 
JM 
SH 
HE 



GATGGAGGTGAGTTCC 1 1 1 1 1 i i 
GATGGAGGTGAGTTCC 



TT 



TT 



GATGGAGGTGAGTTC CTTTTTTT 
GATGGAGGTGAGTTC CT 
GATGGAGGTGAGTTC CT 
GATGGAGGTGAGTTCCT 



TT 



TT 



l' 'i 



"CCTTTC1 
'CCTTTC1 



TTCCTTTC1 
ITCCTTTC1 



TTT--TCCT 



ii CCT 



7k-***-7C*******^**^*****^****"K' 



TTGGAGAATCT 
TTGGAGAATCT 
TTGGAGAATCT 



TTGGAGAATCT 
L ii TTGGAGAATCT 
iTCTTTTGGAGAATCT 



CATTTGCGAGCCTGATTTfllGGATGAMGGGAGAlAlTGATCGGGGGAAAGGT 
CAmGCGAGCCTGATTTfI]3GATGAMGGGAGAMGATCG^GGGAAAGGT 
CATTTGCGAGCCTGATTTfnSGATGAMGGGAGAAlTGATCGj^GGAAAGGT 
CATTTGCGAGCCTGATTTprijGATGAAAGGGAGAGiTGATCG^GGAAAGGT 
CATTTGCGAGCCTGATTTfTGGATGAMGGGAGAATGATcdAhGGAAAGGT 
CAmGCGAGCCTGATTTGbGATGAMGGGAGAjAiTGATCdAhGGAAAGGT 

yyy^XT^^^yy^yi'y^T-^ |yc*************i jx***y*j_)*r******** 
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FIG.5D 



AM AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCROGTCTA 647 

GI AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCAcbTCTA 645 

SY AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCiAcbTCTA 647 

JM AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCACGTCTA 648 

SK AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCACGTCTA 645 

HE AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCCAjGTCTA 649 

AM TAATCCCAGGCTGAGAfTjGGCCGAiGlATGGGAGAATTGCTTGAGCCCTGGAG 697 

GI TAATCCCAGGCTGAGAiTEGCCGAGlATGGGAGAATTGClTGAGCCCTGGAG 695 

SY TMTCCCAGGCTGAGATGGCCGaIaKTGGGAGAATTGCTTGAGCCCTGGAG 697 

JM TMTCCCAGGCTGAGATCGCCGAGiATGGGAGAATTGCTTGAGCCCTGGAG 698 

SH TMTCCCAGGCTGAGAfciGGCCGAjSATGGGAGAATTGCTTGAGCCCTGGAG 695 

HE TMTCCCAGGCTGAGAnpGCCGAGATGGGAGAATTGCTTGAGCCCTGGAG 699 

AM GTTCAGACCAACCTAGGCAGOATAGTGAGATCCCCCATCTCTACAAACAT 747 

GI GTTCAGACCAACCTAGGCAGC'AiTAGTGAGATCCCCCATCTCTACAMCAT 747 

SY GTTCAGACCAACCTAGGCAGCjAirAGTGAGATCCCCCATCTCTACAAACAT 747 

JM GTTCAGACCAACCTAGGCAGCAItaGTGAGATCCCCCATCTCTACAAACAT 748 

SH GTTCAGACCAACCTAGGCAGCkiAGTGAGATCCCCCATCTCTACAAACAT 745 

HE GTTCAGACCAACCTAGGCAGCCTAGTGAGATCCCCCATCTCTACAAACAT 749 



AM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

GI TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

SY TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

JM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 798 

SH TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

HE TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 799 



0 ?~t> 
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FIG.5E 



AM TGG4AfeGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTGiA]3GCTGC^ 847 

GI IGG/^GCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTGAGGCTGdA 845 

SY TGG^nGGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTdAbGCTGCA 847 

JM TGG^jGGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTGkGGCTGdG 848 

SH TGG/^GCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTGGGGCTGCiAi 845 

HE TGG^|3GCTGAGGCGGGAGGATCGCTTGAGCCCAGGMTTTGy\hGCTGckj 849 

AM GTGAGCTGTGATCACACCACTGCApTCCAGCCTCAGTGACAGAQTGAGGC 897 

GI GTGAGCTGTGATCACACCACTGCAClTCCAGCCTCAGTGACAGAiSTGAGGC 895 

sy gtgagctgtgatcacaccactgcacjccagcctcagtgacagawgaggc 897 

jm gtgagctgtgatcacaccactgcajcitccagcctcagtgacagactgaggc 898 

sh gtgagctgtgatcacaccactgcas^tccagcctcagtgacagagtgaggc 895 

he gtgagctgtgatcacaccactgcajc|rccagcctcagtgacagajgnrgaggc 899 

am cctgtctcaaaaawsaamgaaaamgmaaatmtgagggctgtatgga 947 

gi cctgtctcaaaaajahaaaagaaaaaagaaaaaf^tgagggctgtatgga 945 

sy cctgtctcaaaaa^aamg.aaaaaagaaaaatWatgagggctgtatgga 947 

jm cctgtctcaaaaai^aamgaaaaaagaaaaafalatgagggctgtatgga 948 

sh cctgtctcaaaaatgaaaagaaaaaagaaaaafaatgagggctgtatgga 945 

he cc i gtc i caaaaakjbaaaagaaaaaagaaaaatjratgagggctgtatgga 949 

AM ATACftTTCATTATTCATTCACTCACTCACTCACTCAlltCATTCATTCATT 997 

GI ATAdGTTCATTATTCATTCACTCACTCACTCACTCATiTCATTCATTCATT 995 

SY ATACAnTCATTATTCATTCACTCACTCACTCACTCATCCATTCATTCATT 997 

JM ATACjATTCATTATTCAnCACTCACTCACTCACTCAlirCATTCATTCATT 998 

SH ATACiAnCATTAnCATTCACTCACTCACTCACTCATiTCAnCATTCATT 995 

HE ATACATTCATTATTCATTCACTCACTCACTCACTCAWpATTCATTCATT 999 
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FIG.5F 



AM CATTCMC-^TCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTirG 1047 

GI CATTCAACATpTCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTjrh 1045 

SY CATTCMCAlAjGTClTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTcb 1047 

JM CATTCMCAlAiGTCnATTGCATACCTTCTGTTTGCTCAGCTTGGTGCmG 1048 

SH CATTCAACAW3TCTTAnGCATACCTTCTGTTTGCTCAGCTTGGTGCm 1045 

HE CATrCMCAi^TCTTA—GCATACClTCTGMTGCTCAGCTTGGTGCTjrG 1049 

AM GGSCTppTGAGGGGCAGGAGGG^AGGGTGACATGGGTCAOZTGACTCCC 1097 

GI GGGC i pCTGAGGGGCAGGAGGGAGAGGGTGACATCCOTCAGCTGACTCCC 1095 

SY GGGCiGCTGAGGGGCAGGAGGGiAGAGGGTGACATGGGTCAGCTGACTCCC 1097 

JM GGGCTGCTGAGGGGCAGGAGGG'TGAGGGTGACATGGGTCAGCTGACTCCC 1098 

SH GGlCCTirCTGAGGGGCAGGAGGGAGAGGGTGACATGGGTCAGCTGACTCCC 1095 

HE GGGCTlGCTGAGGGGCAGGAGGGABAGGGTGACATCGGTCAAjCTGACTCCC 1099 

AM AGAGTCCACTCCCTGTAGGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1147 

GI AGAGTCCACTCCCTGTAGGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1145 

SY AGAGTCCACTCCCTGTAGGTCGGGCAACAGGCCGTAGAAGTCTGGCAGGG 1147 

JM AGAGTCCACTCCCTGTTTEGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1148 

SH AGAGTCCACTCCCTGTAGGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1145 

HE AGAGTCCACTCCCTGTAGGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1149 

AM CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1197 

GI CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1195 

SY CC i GGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1197 

JM CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1198 

SH CC l GGCCCTGCTGTCGGAATCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA U95 

HE CC I GGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1199 
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FIG.5G 



AM ACTCfTTCCCASCCGTGGGAGCCCCTGCAGCTGCATGTGGATAMGCCGTC 12*7 

GI ACTCTTCCCA)±CGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1245 

SY ACTpiTTCCCAjGbCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1247 

JM ACTOTTCCCAGpCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1?48 

SH ACTCTTCCCAiAjCCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 12*5 

HE ACTGrrCCCAGCCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1249 

AM AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGT 1297 

Gi AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGT 1295 

SY AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGT 1297 

JM AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGfr 1298 

SH AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG,TjT 1295 

HE AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGfT 1299 

AM GAGTAGGAGC|3GACACTTCTGCTTGCCCTTTpTGTAAGAA.GGGGAGAAGG 1347 

GI GAGTAGGAGteGACACTTCTGCTTGCCCTTrcrrGTAAGAAGGGbAGAAGG 1345 

SY GAGTAGGAGCGGACACTTCTGCTTGCCCTTrCTGTMGAAGGGbAGAAGG 1347 

JM GAGTAGGAGCGGACACTTCTGCTTGCCCTTTCTGTAAGAAGGGGAGAAGG 1348 

SH GAGTAGGAGGGGACACTTCTGCTTGCCCTTTGTGTAAGAAGGAbAGAAGG 1345 

HE GAGiAGGAGCGGACACTTCTGCnGCCCTTTCTGTMGMGGGGAGAAGG 1349 

AM GTCTTGCTAAGGAGTACAGGAtAlCTGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

GI GTCTTGCTAAGGAGTACAGGAAjCTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

SY GTCTTGCTAAGGAGTACAGGAAjCTGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

JM GlCTTGCTAAGGAGTACAGGAS^TGTCCGTATTCCTTCCCTTTCTGTGGC 1398 

SH GTCTTGCTAAGGAGTACAGGA^jCTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

HE G I CTTGCTAAGGAGTACAGGATCTGTCCGTATTCCTTCCCTTTCTGTGGC 1399 
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FIG.5H 



AM ACTGCAGCGACdfpCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1447 

GI ACTGCAGCGACCfrCCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1445 

SY ACTGCAGCGACQTCCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1447 

JM ACTGCAGCGACditCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1448 

SH ACTGCAGCGACuTCCTGTiTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1445 

HE ACTGCAGCGACQACCTGTTrTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1449 



AM CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGADACTTT 1497 

GI CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGAEIACTTT 1495 

SY CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGACACTTT 1497 

JM CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGAC'ACTTT 1498 

SH CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGAiTACTTT 1495 

HE CCAGATGCGGCCTCAGCTGCTCCACTCCGMCAATCACTGCTGAGACTTT 1499 

AM CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAiAiAGCTGAAGC 1547 

GI CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGA!aKgCTGAAGC 1545 

SY CCGCAAACTCTTCCGAGTCTACTCCAA.TTTCCTCCGGGGAAAGCTGAAGC 1547 

JM CCGCAAA.CTCTTCCGAGTCTACTCCAATTTCCTCCGGGGaKaGCTGAAGC 1548 

SH CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAAAGCTGAAGC 1545 

HE CCGCAMCTCTTCCGAGTCTACTCCMITTCCTCCGGGGAGAGCTGAAGC 1549 

AM TGTACACAGGGGAGGCCTGCAGGACAGGGGAQAGATGA 1584 

GI TGTACACAGGGGAGGCCTGCAGGACAGGGGAGAJGATGA 1582 

SY TGTACACAGGGGAGGCCTGCAGGACAGGGGACAGATGA 1585 

JM TGTACACAGGGGAGGCCTGCAGGACAGGGGACAGATGA 1585 

SH TGTACACAGGGGAGGCCTGCAGGACAGGGGAClAGATGA 1583 

HE TGTACACAGGGGAGGCCTGCAGGACAGGGGACGGATGA 1586 
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FIG.6 



AM/GI MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDSRVIERYLLEAKEAE 50 

SY MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDiS|RVLERYLLEAKEAE 50 

JM MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDj^RVLERYLLEAKEAE 50 

SH MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDRSRVLERYLLEAKEAE 50 

HE MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDjsjRVLERYLLEAKEAE 50 



AM/GI 

SY 

JM 

SH 

HE 



nitjttgcaehcslnenitvpotkvnfyawkrmevgqqavevwqglallsepy 
nitpriscaehcslnenitvpdtkvnfyawkrmevgqqavevwqglallse-a] 
nitkgcaehcslnenitvpdtkvnfyawkrmevgqqavevwqglallseN 
nitjtgcaehcslnenitvpdtkvnfyawkrmevgqqavevwqglallses 
nitttgcaehcslnenitvpdtkvnfyawkrmevgqqavevwqglallse!a 



100 
100 
100 
100 
100 



AM/GI • VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

SY VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

JM VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

SH VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

HE VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 



"X - ■K"x"3*c ~k "k" "x *k "x - A a a' a' a' *x/ 



AM/GI AASAAPLRTITADTFRKLFRW5NFLRG|CLKLYTGEACRTGDR 

SY AASAA?LRTITADTFRKLFRV!VSNFLRGKLKLYTGEACRTGDR 

JM AASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 

SH AASAAPLRTITADTFRKLFRVY'SNFLRGKLKLYTGEACRTGDR 

HE AASAAPLRTITADTFRKLFRV^SNFLRGELKLYTGEACRTGDGi 

x: ^^^^•^^Tr^^-x-x'-X'-x'Lj 
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193 
193 
193 
193 
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FIG. 7 
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FIG.8 
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FIG. 9 
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FIG. 10 
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